智能论文笔记

FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction

Samiul Alam , Luyang Liu , Ming Yan , Mi Zhang

分类：机器学习 | 计算机视觉

2022-12-03

Most cross-device federated learning (FL) studies focus on the model-homogeneous setting where the global server model and local client models are identical. However, such constraint not only excludes low-end clients who would otherwise make unique contributions to model training but also restrains clients from training large models due to on-device resource bottlenecks. In this work, we propose FedRolex, a partial training (PT)-based approach that enables model-heterogeneous FL and can train a global server model larger than the largest client model. At its core, FedRolex employs a rolling sub-model extraction scheme that allows different parts of the global server model to be evenly trained, which mitigates the client drift induced by the inconsistency between individual client models and server model architectures. We show that FedRolex outperforms state-of-the-art PT-based model-heterogeneous FL methods (e.g. Federated Dropout) and reduces the gap between model-heterogeneous and model-homogeneous FL, especially under the large-model large-dataset regime. In addition, we provide theoretical statistical analysis on its advantage over Federated Dropout and evaluate FedRolex on an emulated real-world device distribution to show that FedRolex can enhance the inclusiveness of FL and boost the performance of low-end devices that would otherwise not benefit from FL. Our code is available at https://github.com/MSU-MLSys-Lab/FedRolex.

translated by 谷歌翻译

Bengali Common Voice Speech Dataset for Automatic Speech Recognition

Samiul Alam , Asif Sushmit , Zaowad Abdullah , Shahrin Nakkhatra , MD. Nazmuddoha Ansary , Syed Mobassir Hossen , Sazia Morshed Mehnaz , Tahsin Reasat , Ahmed Imtiaz Humayun

分类：自然语言处理

2022-06-28

孟加拉语是世界上说话最多的语言之一，全球有超过3亿的演讲者。尽管它很受欢迎，但由于缺乏多样化的开源数据集，对孟加拉语音识别系统的发展的研究受到阻碍。作为前进的道路，我们已经众包孟加拉语音语音数据集，这是句子级自动语音识别语料库。该数据集于Mozilla Common Voice平台上收集，是正在进行的广告系列的一部分，该活动已在2个月内收集了超过400个小时的数据，并且正在迅速增长。我们的分析表明，与OpenSLR孟加拉ASR数据集相比，该数据集具有更多的发言人，音素和环境多样性，这是最大的现有开源孟加拉语语音数据集。我们提供从数据集获得的见解，并讨论未来版本中需要解决的关键语言挑战。此外，我们报告了一些自动语音识别（ASR）算法的当前性能，并为将来的研究设定了基准。

translated by 谷歌翻译

Bengali Handwritten Digit Recognition using CNN with Explainable AI

Md Tanvir Rouf Shawon , Raihan Tanvir , Md. Golam Rabiul Alam

分类：计算机视觉 | 机器学习

2022-12-23

Handwritten character recognition is a hot topic for research nowadays. If we can convert a handwritten piece of paper into a text-searchable document using the Optical Character Recognition (OCR) technique, we can easily understand the content and do not need to read the handwritten document. OCR in the English language is very common, but in the Bengali language, it is very hard to find a good quality OCR application. If we can merge machine learning and deep learning with OCR, it could be a huge contribution to this field. Various researchers have proposed a number of strategies for recognizing Bengali handwritten characters. A lot of ML algorithms and deep neural networks were used in their work, but the explanations of their models are not available. In our work, we have used various machine learning algorithms and CNN to recognize handwritten Bengali digits. We have got acceptable accuracy from some ML models, and CNN has given us great testing accuracy. Grad-CAM was used as an XAI method on our CNN model, which gave us insights into the model and helped us detect the origin of interest for recognizing a digit from an image.

translated by 谷歌翻译

Brain Tumor Synthetic Data Generation with Adaptive StyleGANs

Usama Tariq , Rizwan Qureshi , Anas Zafar , Danyal Aftab , Jia Wu , Tanvir Alam , Zubair Shah , Hazrat Ali

分类：计算机视觉 | 机器学习

2022-12-04

Generative models have been very successful over the years and have received significant attention for synthetic data generation. As deep learning models are getting more and more complex, they require large amounts of data to perform accurately. In medical image analysis, such generative models play a crucial role as the available data is limited due to challenges related to data privacy, lack of data diversity, or uneven data distributions. In this paper, we present a method to generate brain tumor MRI images using generative adversarial networks. We have utilized StyleGAN2 with ADA methodology to generate high-quality brain MRI with tumors while using a significantly smaller amount of training data when compared to the existing approaches. We use three pre-trained models for transfer learning. Results demonstrate that the proposed method can learn the distributions of brain tumors. Furthermore, the model can generate high-quality synthetic brain MRI with a tumor that can limit the small sample size issues. The approach can addresses the limited data availability by generating realistic-looking brain MRI with tumors. The code is available at: ~\url{https://github.com/rizwanqureshi123/Brain-Tumor-Synthetic-Data}.

translated by 谷歌翻译

SafeSpace MFNet: Precise and Efficient MultiFeature Drone Detection Network

Mahnoor Dil , Misha Urooj Khan , Muhammad Zeshan Alam , Farooq Alam Orakazi , Zeeshan Kaleem , Chau Yuen

分类：计算机视觉

2022-11-30

Unmanned air vehicles (UAVs) popularity is on the rise as it enables the services like traffic monitoring, emergency communications, deliveries, and surveillance. However, the unauthorized usage of UAVs (a.k.a drone) may violate security and privacy protocols for security-sensitive national and international institutions. The presented challenges require fast, efficient, and precise detection of UAVs irrespective of harsh weather conditions, the presence of different objects, and their size to enable SafeSpace. Recently, there has been significant progress in using the latest deep learning models, but those models have shortcomings in terms of computational complexity, precision, and non-scalability. To overcome these limitations, we propose a precise and efficient multiscale and multifeature UAV detection network for SafeSpace, i.e., \textit{MultiFeatureNet} (\textit{MFNet}), an improved version of the popular object detection algorithm YOLOv5s. In \textit{MFNet}, we perform multiple changes in the backbone and neck of the YOLOv5s network to focus on the various small and ignored features required for accurate and fast UAV detection. To further improve the accuracy and focus on the specific situation and multiscale UAVs, we classify the \textit{MFNet} into small (S), medium (M), and large (L): these are the combinations of various size filters in the convolution and the bottleneckCSP layers, reside in the backbone and neck of the architecture. This classification helps to overcome the computational cost by training the model on a specific feature map rather than all the features. The dataset and code are available as an open source: github.com/ZeeshanKaleem/MultiFeatureNet.

translated by 谷歌翻译

MAIL: Malware Analysis Intermediate Language

Shahid Alam

分类：自然语言处理

2022-11-06

This paper introduces and presents a new language named MAIL (Malware Analysis Intermediate Language). MAIL is basically used for building malware analysis and detection tools. MAIL provides an abstract representation of an assembly program and hence the ability of a tool to automate malware analysis and detection. By translating binaries compiled for different platforms to MAIL, a tool can achieve platform independence. Each MAIL statement is annotated with patterns that can be used by a tool to optimize malware analysis and detection.

translated by 谷歌翻译

BanglaSarc: A Dataset for Sarcasm Detection

Tasnim Sakib Apon , Ramisa Anan , Elizabeth Antora Modhu , Arjun Suter , Ifrit Jamal Sneha , MD. Golam Rabiul Alam

分类：自然语言处理 | 人工智能

2022-09-27

作为世界上口语最广泛的语言之一，孟加拉国的使用在社交媒体世界中也在增加。讽刺是一种积极的陈述或言论，其基本的负面动机在当今的社交媒体平台中广泛使用。在过去的许多年中，英语的讽刺检测有了显着改善，但是有关孟加拉讽刺检测的情况仍然没有改变。结果，仍然很难识别孟加拉国中的讽刺，缺乏高质量的数据是主要因素。本文提出了Banglasarc，该数据集是专门为孟加拉文本数据讽刺检测的数据集。该数据集包含5112条评论/状态和从各种在线社交平台（例如Facebook，YouTube）以及一些在线博客中收集的内容。由于孟加拉语中分类评论的数据收集数量有限，因此该数据集将有助于确定讽刺的研究，认识到人们的情绪，检测到各种类型的孟加拉语表达式和其他领域。该数据集可在https://www.kaggle.com/datasets/sakibapon/banglasarc上公开获得。

translated by 谷歌翻译

Machine Learning and Artificial Intelligence-Driven Multi-Scale Modeling for High Burnup Accident-Tolerant Fuels for Light Water-Based SMR Applications

Md. Shamim Hassan , Abid Hossain Khan , Richa Verma , Dinesh Kumar , Kazuma Kobayashi , Shoaib Usman , Syed Alam

分类：机器学习 | (统计)机器学习

2022-09-25

小型模块化反应堆的概念改变了解决未来能源危机的前景。考虑到其较低的投资要求，模块化，设计简单性和增强的安全功能，这种新的反应堆技术非常有希望。人工智能驱动的多尺度建模（中子，热液压，燃料性能等）在小型模块化反应堆的研究中纳入了数字双胞胎和相关的不确定性。在这项工作中，进行了一项关于耐亡燃料的多尺度建模的全面研究。探索了这些燃料在轻水的小型模块化反应堆中的应用。本章还重点介绍了机器学习和人工智能在设计优化，控制和监视小型模块反应器中的应用。最后，简要评估了有关人工智能在高燃烧复合事故耐受燃料的发展中的研究差距。还讨论了实现这些差距的必要行动。

translated by 谷歌翻译

Impact analysis of recovery cases due to COVID19 using LSTM deep learning model

Md Ershadul Haque , Samiul Hoque

分类：机器学习 | 计算机视觉

2022-09-06

当今世界受到新颖的冠状病毒（Covid-19）的严重影响。使用医疗套件来识别受影响的人非常慢。接下来会发生什么，没人知道。世界正面临不稳定的问题，不知道在不久的将来会发生什么。本文试图使用LSTM（长期记忆）对冠状病毒恢复病例进行预后。这项工作利用了258个地区的数据，其纬度和经度以及403天的死亡人数范围为22-01-2020至27-02-2021。具体而言，被称为LSTM的先进基于深度学习的算法对为时间序列数据（TSD）分析提取高度必不可少的特征产生了极大的影响。有很多方法已经用于分析传播预测。本文的主要任务最终在分析使用基于LSTM深度学习的体系结构分析冠状病毒在全球恢复案例中的传播。

translated by 谷歌翻译

Rice Leaf Disease Classification and Detection Using YOLOv5

Md Ershadul Haque , Ashikur Rahman , Iftekhar Junaeid , Samiul Ul Hoque , Manoranjan Paul

分类：计算机视觉 | 人工智能

2022-09-04

全球一百多个国家的主食是大米（Oryza sativa）。大米的种植对于全球经济增长至关重要。但是，农业产业面临的主要问题是水稻疾病。农作物的质量和数量下降了，这是主要原因。由于任何国家的农民对水稻疾病都没有太多了解，因此他们无法正确诊断稻叶疾病。这就是为什么他们不能适当照顾米叶的原因。结果，生产正在减少。从文献调查中，Yolov5表现出更好的结果与其他深度学习方法相比。由于对象检测技术的不断发展，Yolo家族算法具有非常高的精度和更好的速度，已在各种场景识别任务中使用，以构建稻叶疾病监测系统。我们已经注释了1500个收集的数据集，并提出了基于Yolov5深学习的水稻疾病分类和检测方法。然后，我们训练并评估了Yolov5模型。模拟结果显示了本文提出的增强Yolov5网络的对象检测结果的改进。所需的识别精度，召回，MAP值和F1得分的水平分别为90 \％，67 \％，76 \％和81 \％\％被视为性能指标。

translated by 谷歌翻译